# Self-supervised Vision Transformer
Vit Huge Patch14 224.mae
A large-scale image feature extraction model based on Vision Transformer (ViT), pre-trained on the ImageNet-1k dataset using the self-supervised masked autoencoder (MAE) method
Image Classification
Transformers

V
timm
104
0
Vit Small Patch8 224.dino
Apache-2.0
Self-supervised image feature extraction model based on Vision Transformer (ViT), trained using the DINO method
Image Classification
Transformers

V
timm
8,904
2
Vit Base Patch8 224.dino
Apache-2.0
A vision Transformer (ViT) image feature model trained with the self-supervised DINO method, suitable for image classification and feature extraction tasks.
Image Classification
Transformers

V
timm
9,287
1
Beit Large Patch16 224 Pt22k
Apache-2.0
BEiT is a self-supervised learning model based on Vision Transformer (ViT), pretrained on the ImageNet-21k dataset for image classification tasks.
Image Classification
B
microsoft
237
2
Beit Large Patch16 224 Pt22k Ft22k
Apache-2.0
BEiT is a Vision Transformer (ViT)-based image classification model, pre-trained in a self-supervised manner on ImageNet-22k and fine-tuned on the same dataset.
Image Classification
B
microsoft
1,880
5
Beit Base Patch16 224 Pt22k
Apache-2.0
BEiT is a vision Transformer-based model pre-trained on the ImageNet-21k dataset through self-supervised learning for image classification tasks.
Image Classification
B
microsoft
2,647
3
Beit Base Patch16 224 Pt22k Ft22k
Apache-2.0
BEiT is a Vision Transformer (ViT)-based image classification model, pre-trained in a self-supervised manner on ImageNet-22k and fine-tuned on the same dataset.
Image Classification
B
microsoft
546.85k
76
Dino Vitb16
Apache-2.0
A Vision Transformer model trained using the DINO self-supervised method, based on the ViT architecture and pretrained on the ImageNet-1k dataset.
Image Classification
Transformers

D
facebook
122.46k
108
Featured Recommended AI Models